Evaluation of potential power gain with imputed genotypes in genome-wide association studies.
نویسندگان
چکیده
BACKGROUND With the beginning of the era of genome-wide association studies methods to obtain 'in silico' genotypes have gained importance. In this context, an evaluation of genome-wide power levels of current marker panels and the power gain achievable with imputed genotypes are of high interest. METHODS Power for single-marker analysis of imputed genotypes is evaluated via a simulation study based on HapMap data. Power values for genome-wide significance of marker panels of 1,000,000 SNPs are considered for small effect sizes typical of common diseases and large case-control samples. In order to evaluate the performance of imputing, we consider a method that is conceptually related to previous approaches. We introduce various modifications which together lead to an alternative implementation of the imputation idea. In particular, a Monte-Carlo (MC) simulation method for association testing of imputed markers is introduced. RESULTS We show that the incorporation of imputed genotypes can lead to a substantial power gain for common disease variants if the training sample is large enough. In addition, we show that the MC approach is valuable to for validating association results obtained with imputed genotypes. DISCUSSION Our simulation study also shows that even denser marker panels than those currently available are needed when sample size is limited. We thus expect that full genome SNP panels will lead to the identification of additional disease variants in the future. Until then, it is desirable that large and ethnically matched training samples genotyped on dense marker panels are available in each country.
منابع مشابه
Extending rare-variant testing strategies: analysis of noncoding sequence and imputed genotypes.
Next Generation Sequencing Technology has revolutionized our ability to study the contribution of rare genetic variation to heritable traits. However, existing single-marker association tests are underpowered for detecting rare risk variants. A more powerful approach involves pooling methods that combine multiple rare variants from the same gene into a single test statistic. Proposed pooling me...
متن کاملGenotypic discrepancies arising from imputation
The ideal genetic analysis of family data would include whole genome sequence on all family members. A strategy of combining sequence data from a subset of key individuals with inexpensive, genome-wide association study (GWAS) chip genotypes on all individuals to infer sequence level genotypes throughout the families has been suggested as a highly accurate alternative. This strategy was followe...
متن کاملFrequentist tests of association for imputed genotypes
Servin and Matthews [17] proposed looking for associations between phenotypes and both typed and untyped SNPs, by using a reference panel to infer the alleles of the untyped SNPs. Since then, a number of GWA studies have reported p-values for both typed and untyped SNPs. However, results of Almeida et al [1] indicate that using imputed genotype data can lead to increased type I error. We discus...
متن کاملThe value of relatives with phenotypes but missing genotypes in association studies for quantitative traits.
The additional statistical power of association studies for quantitative traits was derived when ungenotyped relatives with phenotypes are included in the analysis. It was shown that the extra power is a simple function of the coefficient of additive genetic relationship and the phenotypic correlation coefficient between the genotyped and ungenotyped relatives. For close relatives, such as pair...
متن کاملGenome Wide Association Studies, Next Generation Sequencing and Their Application in Animal Breeding and Genetics: A Review
Recently genetic studies have been revolutionized by next generation sequencing (NGS) technology, and it is expected that the use of this technology will largely eliminate defects in the methods of association studies. The NGS technology is becoming the premier tool in genetics. However, at the moment the use of this method is limited especially in the livestock due to high cost and computation...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Human heredity
دوره 68 1 شماره
صفحات -
تاریخ انتشار 2009